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ABSTRACT 

RNase P processes the 5 -end of tRNAs. An essential 
catalytic RNA has been demonstrated in Bacteria, 
Archaea and the nuclei of most eukaryotes; an 
organism-specific number of proteins complement 
the holoenzyme. Nuclear RNase P from yeast and 
humans is well understood and contains an RNA, 
similar to the sister enzyme RNase MRP. In 
contrast, no protein subunits have yet been identified 
in the plant enzymes, and the presence of a nucleic 
acid in RNase P is still enigmatic. We have thus set 
out to identify and characterize the subunits of these 
enzymes in two plant model systems. Expression of 
the two known Arabidopsis MRP RNA genes in vivo 
was verified. The first wheat MRP RNA sequences are 
presented, leading to improved structure models for 
plant MRP RNAs. A novel mRNA encoding the central 
RNase P/MRP protein Poplp was identified in 
Arabidopsis, suggesting the expression of distinct 
protein variants from this gene in vivo. 
Poplp-specific antibodies precipitate RNase P 
activity and MRP RNAs from wheat extracts. Our 
results provide evidence that in plants, Poplp is 
associated with MRP RNAs and with the catalytic 
subunit of RNase P, either separately or in a single 
large complex. 

INTRODUCTION 

RNase P is the ubiquitous and essential endonuclease 
required for tRNA 5'-end maturation; additional func- 
tions involve the cleavage of some mRNAs and 
non-coding RNAs. In most organisms and their organ- 
elles, the enzyme consists of one RNA and up to 



10 protein subunits (1,2). The catalytic centre resides 
in the RNA which can function as ribozyme in vitro, 
although with weak efficiency for more complex organ- 
isms (3,4). The composition of nuclear RNase P from 
yeast and humans is known in detail, but no subunits 
have hitherto been characterized for the nuclear enzyme 
from plants. The primitive plastids from the unicellular 
alga Cyanophora paradoxa contain an RNase P with an 
essential and catalytically active RNA component (5-8); 
plastid and mitochondrial genomes from certain algal 
lineages encode an RNase P RNA (9,10). RNase P 
from chloroplasts and mitochondria of higher plants 
however is a protein enzyme (ProRP) similar to the 
human mitochondrial RNase P (11-13). Previous 
studies on the nuclear enzyme from carrot and wheat 
germ suggested the presence of an essential nucleic acid 
component (14,15). However, the exact molecular com- 
position of the enzyme from higher plants remained 
unsolved. 

In addition to RNase P, the structurally related RNase 
mitochondrial RNA processing (MRP) is present in eu- 
karyote nuclei and contains a similar RNA subunit. Both 
enzymes are localized in the nucleolus and form distinct 
RNP complexes (16,17). Most proteins in the RNase P 
and MRP complexes are identical, but some are specific 
for one of the enzymes (1). RNase MRP cleaves the large 
rRNA precursor at the A3 site and is also involved in 
mitochondrial DNA replication, cleavage of some cyto- 
plasmic mRNAs and production of siRNAs (18,19). In 
earlier studies, an RNase MRP RNA gene from 
Arabidopsis thaliana (AtMRPl) had been identified; the 
expression of the corresponding RNA and of three 
tobacco MRP RNAs in vivo was experimentally verified 

(20) . More recently, in silico searches have detected a 
second, slightly different putative RNase MRP RNA 
gene in Arabidopsis (AtMRP2), and one in Oryza saliva 

(21) ; however, no expression data are available for 
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these RNAs, and the relationship to the enzyme remains 
unclear. 

The largest and one of the central proteins within RNase 
P and MRP is Poplp. In yeast and human cells, Poplp 
directly binds to several proteins and presumably to the 
RNA component in these two separate enzyme complexes 
(1,2,22-24). Its target is possibly the P3 region in RNase P 
and MRP RNA (25), and binding may be facilitated by the 
POP6/POP7 heterodimer (26). In all organisms studied, the 
four highly conserved regions COR1-COR4 are present. 
Mutational analysis revealed that in yeast, the Popl 
domain consisting of COR1 and COR2 is required for 
RNP formation. Conserved residues in COR1 and COR4 
influence RNase P activity, whereas residues in all four 
regions contribute to RNase MRP activity (27,28). In 
A. thaliana, several mRNA splicing variants have been 
annotated for the single gene encoding this protein 
(AtPoplp), raising the question of their expression and 
function. 

To get insight into the holoenzyme composition of plant 
nuclear RNase P and MRP, we have set out to identify the 
subunits of both Arabidopsis enzymes in silico and to inves- 
tigate their expression and relation to the enzyme complex. 
Here, we concentrate on the central protein Poplp and the 
RNAs annotated as MRP RNA. These data are comple- 
mented by functional studies using the established in vitro 
tRNA processing system from wheat germ (15). Our 
expression studies of AtPopl mRNAs reveal a novel 
splicing form encoding a hitherto unknown AtPoplp 
variant. The presence of both Arabidopsis MRP RNAs 
in vivo was verified. Two novel MRP RNA sequences 
from wheat are presented, and improved structural 
models for plant MRP RNAs are suggested. AtPoplp- 
specific antibodies precipitate RNase P activity and MRP 
RNAs from wheat, suggesting a close physical association 
of Poplp with both enzymes in plants. 



MATERIALS AND METHODS 

Chemicals were purchased from Applichem, Carl Roth, 
Merck or Sigma-Aldrich if not stated otherwise, and of the 
highest purity available. Enzymes were from GE 
Healthcare, Roche Applied Science, Fermentas, New 
England Biolabs or Promega and used according to the 
manufacturer's instructions. Radionucleotides were 
purchased from Hartmann Analytic. A. thaliana var. 
Columbia was grown as described (29); young leaves from 
3- to 4-weeks-old plants were used for RNA and protein 
preparations. Unprocessed wheat germ was obtained from 
SynPharma (Augsburg, Germany); S23 extract was 
prepared from defatted germs according to (15) and stored 
at — 20°C. All cloning experiments were performed with gel 
purified vectors and inserts if not stated otherwise. 
Sequencing reactions were analysed on ABI Prism sequen- 
cers by Institut fur Humangenetik (University of 
Wiirzburg), IZKF Leipzig or Seqlab, Gottingen. All se- 
quences were obtained from both strands, using universal 
or internal primers as necessary. Novel nucleotide sequence 
data are available under the accession numbers: FN673552 



(Arabidopsis Popl mRNA variant 3), FN689352 (wheat 
MRP RNA1) and FN689353 (wheat MRP RNA2). 

Oligonucleotide primers 

Oligonucleotides used in this study were purchased from 
Eurogentec. For blunt-end cloning of PCR products, 
phosphorylated amplification primers were used. 

Oligo(dT): dT 16 V; AtPOPl-F: ATGGCTACTACTGCG 
AATGG; AtPOPl-R: TCAAGAATAAGTGTGAATGCC; 
AtPOPl-3'ss: AGGAAGGGGATCTAGGG; Anchor2: CG 
GACGTCACGCGTCGACCTACT 20 ; UAPshort: GTCAC 
GCGTCGACCTAC; Anchor-BRL: GCCACGCGTCGAC 
TAGTACGGGIGGGIGGGIG; AtPOPl-GSPl: GGTCA 
AGTATCAGGAGGAG; AtPOPl-GSP6R: CAATCGCC 
TGAGAAACTGG; AtPOPl-GSP7R: GCCAGTGAGAA 
TAGAGTCG; AtPOPl-GSP9R: GCAATGTGATAGCTT 
GCATC; 3'AtMRPl+2-anti4: AAGCTCCACACACGCG 
TCTC; 3'AtMRPl+2-anti5: GATTCTCTCGTTACCGAG 
GC; 3'MRP-anti3: CCCGTTCAGTTAGTCC; 5'MRP: CA 
GGAAAGTCCCCGG; 5'PlantMRP-CRI: CCAGGAAAG 
TCCCCGG; 3'PlantMRP-CRVanti-W: TAAGCCCCG 
TTCWGTTA; 3'PlantMRP-CRVanti-T: TAAGCCCCGT 
TCTGTTA; WG-MRPl-anti: ACGGACCAAGTGAATA 
AC; WG-MRP2-anti: GGGCTCCACACTCGTCTC. 

Isolation of nucleic acids 

Genomic DNA from plants was essentially prepared 
according to (29) and dissolved in ddH 2 0. 

Total RNA was prepared as described (30) from 5g 
Arabidopsis leaves and dissolved in 2.5 ml TE (10 mM 
Tris-Cl, 1 mM Na 2 EDTA, pH 7.5). Isolation of mRNA 
was performed by chromatography on Oligo(dT)- 
Cellulose. After binding of RNA to the equilibrated 
matrix in binding buffer [TE (pH 7.5), 0.2 % (w/v) SDS, 
0.5 M NaCl], the slurry was transferred into a column and 
unbound RNAs were removed with washing buffer [0.2 % 
(w/v) SDS, 0.25 M NaCl in TE]. Poly(A + )-RNAs were 
eluted with mRNA elution buffer [0.2 % (w/v) SDS in 
TE] at 65°C and precipitated with EtOH. 

Small RNAs were prepared by grinding fresh or frozen 
plant tissue in ice-cold TE buffer followed by extraction 
with one volume of phenol/chloroform. Wheat germ was 
first ground to powder, mixed with two volumes of TE and 
incubated for 20min at 4°C before homogenization and 
phenol extraction. Large nucleic acids were precipitated 
with 2 M NaCl in an ice bath and removed by centrifuga- 
tion. Small RNAs were EtOH precipitated from the super- 
natant, reprecipitated to reduce the salt concentration and 
dissolved in TE. All RNAs used for RT-PCR analysis were 
treated with RNase-free DNase I according to the manu- 
facturer's suggestions, followed by either heat inactivation 
or phenol extraction, and precipitation. 

Cloning of the Arabidopsis Popl domain and analysis of 
AtPopl mRNAs by Rapid Amplification of cDNA Ends 
(RACE) 

The originally annotated AtPoplp coding sequence 
(At2g47290.1; AAB63829.1) was cloned by converting 
poly(A + ) RNAs into cDNAs using the anchored 
oligo(dT) primer dT 16 V and AMV Superscript Reverse 
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Transcriptase (RT). The open reading frame (ORF) was 
amplified with Pfu polymerase and the primers 
AtPOPl-F and AtPOPl-R, and ligated into the Smal site 
of pUC19 to give pAtPOPl. Construction of expression 
vectors for recombinant AtPoplp is described in 
Supplementary Figure S5. The recombinant Popl domain 
used for functional studies is translated from 
pRSET-AtPOPl, which contains the unspliced exons 
1 and 2 (variant 1 u) attached to the T7 promoter and an 
N-terminal His-tag. Translation termination within the 
intron results in a 187 amino acids long recombinant 
protein consisting of the 151 residues long truncated Popl 
domain attached to the affinity tag. 

The 3'-end of AtPopl mRNAs in vivo was identified by 
RACE from cDNA prepared as above; consecutive 
amplifications were performed with the primer pairs 
AtPOP 1 -F + Anchor2, AtPOP 1 -GSP 1 + U APshort and 
AtPOPl-3 , ss+ UAPshort. The transcription start site was 
determined by 5'-RACE using AtPOPl-GSP6R for 
cDNA synthesis. After tailing the DNA strand with 
Terminal Transferase and dATP, nested PCR was per- 
formed with the primer pairs AtPOPl-GSP7R + Anchor2 
and AtPOP 1GSP9R + UAPshort. Cloning of RACE 
products unbiased by insert size was achieved by electro- 
phoretic fractionation of amplificates and separate ligation 
of different size classes into pCR-XL-TOPO or 
pCRII-TOPO, if not stated otherwise. 

Preparation of recombinant AtPoplp 
and antibody production 

Recombinant AtPoplp was produced in Rosetta™ 
(DE3)pLysS (Novagen) transformed with pRSET- 
AtPOPl. Cultures were grown at 37°C in LB medium con- 
taining 100 ug/ml ampicillin, 34 ug/ml chloramphenicol and 
2% glucose. Expression was induced at an OD 60 o of 0.5 by 
adding 1 mM IPTG and growth was continued at 33°C for 
3^4- h. Cell harvest, lysis in 5 ml of denaturing lysis buffer A 
and affinity chromatography on Ni 2+ -NTA-Agarose (2 ml 
column volume) were performed essentially according to the 
manufacturer's protocol (Qiagen); all solutions contained 
1 mM phenylmethylsulfonyl fluoride (PMSF). Protein 
renaturation was achieved on-column by a reverse urea 
gradient (8-2 M). The bound protein was eluted with a 
dual gradient bringing the pH to 4.5 and concomitantly 
raising the imidazol concentration to 250 mM. Fractions 
containing purified protein were identified by SDS-PAGE 
and protein concentrations were determined according to 
(31) with BSA as standard. Pooled fractions were stored in 
elution buffer at 4°C and dialysed against buffer F-GP [20 
mM Tris-Cl pH 7.6, 5mM MgCl 2 , 20 % (v/v) glycerol, 
0.5 mM PMSF] directly before use. For antibody produc- 
tion, the His-tag was removed by digestion with 
Enterokinase and the reaction products isolated by SDS- 
PAGE. Approximately 300 ug of protein in gel pieces were 
used to immunize a rabbit (Eurogentec). Antibodies directed 
against Escherichia coli proteins were removed from the 
serum according to (32). For some applications, the IgG 
fraction was purified by precipitation with 40% 
ammonium sulfate. 



Immunoprecipitation of Poplp-containing complexes 
from wheat germ extract 

All manipulations were performed at 4°C and under 
gentle agitation if not stated otherwise. Antibody specifi- 
city was determined by western blot analysis of 
Arabidopsis proteins as described in Supplementary 
Figure S4. 

For native IP, non-specifically binding wheat proteins 
were first eliminated by overnight incubation of 1 mg 
wheat S23 protein with lOmg hydrated protein 
A-Sepharose beads (Amersham Bioscience) per assay in 
lml buffer F2 (20 mM Tris-Cl pH 8, 10 mM MgCl 2 , 
20 mM DTT, 20% Glycerol, 1 mM PMSF, 1 uM 
Pepstatin, 2uM Leupeptin, 1 U/ml RNasin). The beads 
were removed by centrifugation and this pretreated extract 
was used in immunoprecipitations. Coupling of antibodies 
to protein A-Sepharose and specific binding of antigen 
were performed essentially as described (33). For 
immunoprecipitation, 10 mg beads and 25 ug AtPoplp- 
specific IgG per assay were incubated overnight. After 
coupling and washing, the beads were collected by centri- 
fugation, equilibrated in buffer Fl (20 mM Tris-Cl pH 8, 
5mM MgCl 2 ) and stored at 4°C. The pretreated S23 
extract was bound to the antibody-coupled protein 
A-Sepharose by overnight incubation. The beads were 
washed twice with buffer WIP2 [50 mM Tris-Cl pH 7.5, 
150mM NaCl, 1% (v/v) NP40, 0.5% (w/v) sodium 
deoxycholate] and WIP3 [50 mM Tris-Cl pH 7.5, 
500 mM NaCl, 0.1% (v/v) NP40, 0.05% (w/v) sodium 
deoxycholate], collected by centrifugation, resuspended 
in 30 ul buffer Fl and assayed for RNase P activity. For 
analysis of RNAs present in the bound material, the 
immunoprecipitate was resuspended in 150 ul TE and 
the nucleic acids were recovered by phenol extraction 
and EtOH precipitation. 

Assay of RNase P activity 

Transcription with T7 RNA polymerase and RNA purifica- 
tion by denaturing PAGE was performed as described (34). 
The radioactive transcripts were visualised by autoradio- 
graphy or with a Phosphoimager (Molecular Dynamics), 
eluted and ethanol precipitated. Wheat RNase P activity 
was assayed in buffer Fl with transcripts obtained from 
pSu3-wt cut with Fokl, yielding E. coli pre-tRNA Tyr . 
Reactions contained 5000-10000 cpm of [ 32 P]-labelled 
substrate (17-35 fmol) and enzyme preparations as indicated 
in a total volume of 20 ul. After incubation at 37°C for 
30 min, samples were ethanol precipitated and analysed by 
denaturing PAGE on 8% polyacrylamide (PAA) gels. 
RNase P assays in immunoprecipitated protein complexes 
were initiated by addition of [ 32 P]-labelled substrate to the 
resuspended protein A-Sepharose beads. After incubation at 
37°C for 60 min with gentle agitation, the reaction was 
stopped by addition of 30 ul ddH 2 0 and the slurry extracted 
with 60 ul TE-saturated phenol. After EtOH precipitation, 
reaction products were analysed as above. Control reactions 
with the Ml ribozyme were performed in buffer P2 as 
described (35). 
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Identification of AtMRP RNAs in vivo and determination 
of RNA termini 

The cycling protocol for expression analysis by RT-PCR 
was first optimized by amplification of the two single copy 
MRP genes from genomic DNA with the primers 5'MRP 
and 3'MRP-anti3; a T a of 54°C was used for all further 
experiments. Simultaneous cDNA synthesis from both 
AtMRP RNAs was achieved by reverse transcription of 
total RNA (two independent preparations) with 3'MRP- 
anti3 and M-MLV RT (Promega) at 50°C. After purifica- 
tion {PCR Purification Kit, Roche), cDNAs were 
amplified with 5'MRP and 3'MRP-anti3 and analysed 
by cleavage with EcoRV and electrophoresis on 
non-denaturing 8% PAA gels in TBE. Plasmids carrying 
the two MRP genes (pAtMRPl and 2, respectively) were 
treated accordingly and used as controls. The 5'-ends of 
MRP RNAs were determined by RACE as described for 
Popl mRNA, using 3'MRPanti3 for cDNA generation. 
After dATP tailing and second strand synthesis with 
Anchor2, nested PCR was performed with UAPshort 
and the internal primers 3'MRPl+2-anti4 and 
3'ATMRPl+2-anti5, and the products were cloned into 
pCR-XL-TOPO. 

Identification and sequence determination 
of wheat MRP RNA 

The core region of wheat MRP RNA genes was amplified 
from genomic DNA with 5'PlantMRP-CRI and 
3'PlantMRP-CRVanti-W. The 5' region of the RNA was 
obtained by RACE as described above, using a small RNA 
preparation from wheat germ and 3'PlantMRP-CRVanti-W 
as cDNA primer. After 5' tailing and amplification with 
anchor2 and the specific primers WG-MRPlanti and 
WG-MRP2anti, cDNAs were cloned into pCRII-TOPO for 
sequencing. Identification of MRP RNAs in immunopre- 
cipitates was performed after phenol extraction and DNase 
I treatment of the bound material. RT-PCR was accom- 
plished with 3'PlantMRP-CRVanti-T as cDNA primer, 
followed by amplification with the primer pairs 
5'PlantMRP-CRI + 3'PlantMRP-CRV-anti-T and 5'Plant 
MRP-CRI + WG-MRP-anti, cloning and sequencing. 
Wheat germ RNase P prepared by successive column chro- 
matography (Blue Trisacryl pool) (15) was analysed with the 
same strategy. 

Analysis of RNA populations by non-specific 
RACE analysis 

For identification of unknown RNA species in immunopre- 
cipitates, the RNAs were prepared as above. Following 3' 
polyadenylation with poly(A) polymerase, the RNAs were 
purified with the RNeasy Mini Kit (Qiagen) and reverse 
transcribed using the d(T 16 )V primer and AMV RT 
(Promega). cDNAs were purified through Microcon30 
columns (Amicon), and tailed with dCTP and Terminal 
Transferase. Second strand synthesis was performed with 
the BRL anchor primer; this primer together with Anchor2 
was used for amplification. Reaction products were electro- 
phoretically separated (4% NuSieve GTG Agarose, 



Cambrex), eluted in different size classes, cloned into 
pCRII-TOPO and sequenced. 

Bioinformatic analysis 

The NCBI and EBI tools were used for sequence searches 
and comparisons by BLAST. Promoter analysis of the 
AtPOPl gene was performed with PlantPromDB (36), 
Athena (37) and Agris (38). RNA structures were con- 
structed with mfold (39), and RNAviz was used for their 
graphical representation (40). 

RESULTS 

AtPopl mRNA is present in multiple splicing isoforms, 
possibly encoding several distinct proteins 

Poplp has a size of about lOOkDa in most organisms. The 
original annotation of a Poplp-related mRNA in 
A. thaliana however postulated only two spliced exons 
encoding a 190 residues long protein of 21.5 kDa, 
starting at AUG1 (TAIR: At2G47290.1; AAB63829.1; 
Figure 1, variant 1; Supplementary Figure SI). Attempts 
to clone the corresponding cDNA with AtPOPl-R as 
cDNA primer exclusively yielded the unspliced sequence 
(variant 1 u), encoding a 151 amino acids long ORF 
terminating at an in-frame UAA stop codon at the begin- 
ning of intron 1. This cDNA contains the Popl domain 
except for the last eight conserved residues (27,28) and was 
cloned into pRSET A for heterologous overexpression. 
The resulting 187 amino acids (21.4 kDa) fusion protein 
containing an N-terminal affinity tag was used for func- 
tional studies. 

Two recently annotated longer mRNAs assembled from 
ESTs differ at the splice site positions of introns 1 and 2 
(DQ069804 and DQ069805 on the At2g47290/At2G47300 
locus (41); variants 2a and 2b in Figure 1 ; AT2G47300.2 and 
AT2G47300.3 in Supplementary Figure SI). Both mRNAs 
have a shorter exon 2 than variant 1 . This splicing pattern 
explains the exclusive detection of unspliced RNA (variant 

1 u) during cloning of the originally annotated mRNAl : the 
cDNA primer AtPOPl -R is located within the actual intron 

2 and thus hybridizes only to unprocessed pre-mRNA. 
DQ069804 mRNA encodes an 826 amino acids protein of 
92.5 kDa, similar to Poplp from other organisms 
(Supplementary Figure S3; Ath 2a). DQ069805 mRNA 
potentially encodes two proteins: the first ORF starts at 
AUG1; alternative splicing of introns 1 and 2 leads to 
frameshift and thus to translation termination at UAG1 
within exon 2. No cleavage and polyadenylation signals 
are found downstream of this and the other premature 
stop codons UAA1 and UGA1. A second ORF starting at 
AUG2 in the same exon encodes a putative 73.8 kDa protein 
lacking the Popl domain (Ath 2b, 659 amino acids; 
Supplementary Figure S3). No strong initiation signal 
(Kozak sequence) is present close to AUG2 and internal 
initiation is rare in plants, making translation from this 
start codon unlikely (42,43). 

To resolve the discrepancy between the different mRNA 
annotations, we determined the native RNA termini. 
Unexpectedly, RACE analysis revealed a novel mRNA of 
1214 nt from 5' untranslated region to polyadenylation site. 
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Figure 1. Splicing patterns of AtPopl mRNAs. The 3344 bp long genomic region between AUG1 and UAA2 is shown as a heavy black line; 
distances in bp are indicated by the ruler above the line. Start and stop codons delimiting the expressed coding regions are drawn as solid black 
arrows and those with unclear in vivo function as outlined arrows. The position of the conserved regions (Popl domain and POPLD) is depicted as 
heavy grey lines below the genomic region. The exons of mRNA variants 1-3 are drawn as grey boxes identified by roman numerals and connected 
by dotted lines indicating the length of excised introns. Variant 1 encodes a 190 amino acids protein containing the Popl domain; in the unspliced 
RNA, the ORF terminates at UAA1 after 151 residues. Variants 2a and b are combined into one drawing: 2a encodes an 826 amino acids long ORF 
extending from AUG1 to UAA2. In variant 2b, the alternative splicing events at the borders of exon 2 are indicated by black dots. The resulting 
frame shift creates a 659 residues long ORF between AUG2 and UAA2. The novel variant 3 has an extended exon 4 and encodes a 307 amino acids 
ORF terminating at UGA2. For accession numbers and sequence details, see text and Supplementary Figures SI and S3. 



An ORF of 924 nt extending from AUG1 into the 
annotated intron 4 encodes a 307 amino acids protein of 
34.6kDa (Figure 1, variant 3; Supplementary Figures SI, 
FN673552 and S3, Ath 3). The sequence between exons 1 
and 4 is identical to DQ069804 mRNA (At2G47300.2) 
including the rare GC-AG splice sites of intron 2. The 3'- 
end of the novel mRNA 3 is formed by retaining intron 4, 
which contains an in-frame stop codon (UGA2), 
polyadenylation signals and a cleavage site downstream 
this stop codon. The transcription start site of AtPopl 
mRNAs was established by 5' RACE from cDNA initiated 
within exon 4 and revealed a 5' untranslated region of 
273 nt, slightly longer than for the annotated mRNAs 2a 
and 2b. The upstream promoter region contains several 
transcriptional control elements and factor binding sites 
at about —235 to +80 nt relative to the postulated tran- 
scription start site (Supplementary Figure SI). 
Examination of global expression databases (e.g. TAIR, 
SIGnAL) revealed that RNAs derived from this locus are 
prevalent in fast developing tissue (shoot apex, sperm cells, 
embryo, flower and seed), and are prone to 
stage-dependent expression during mature embryo, petal 
formation and cotyledon stages. The presence of AtPopl 
mRNAs in vivo was further confirmed by Northern blot 
analysis using oligonucleotide probes specific for the two 
alternative splicing variants of intron 2 (Supplementary 
Figure S2). 

Two MRP RNAs are expressed in A. t ha liana and 
contain signature sequences of RNase P RNA 

Arabidopsis contains two genes for RNase MRP RNA; 
data on expression and function are only available for 
MRP1. To examine whether the AtMRP2 gene is func- 
tional, we determined the relative steady-state levels of 
both AtMRP RNAs in vivo. Due to the high similarity 
between these sequences, qPCR analysis was not 
feasible. Instead, an approach was chosen where simultan- 
eous amplification with a single primer pair was followed 
by restriction cleavage, electrophoresis and quantification 
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Figure 2. Steady-state levels of Arabidopsis MRP1 and MRP2 RNA 
in vivo. The following templates were used for amplification of MRP 
sequences: CI: lng genomic DNA from A. thaliana; Rl, R2: 
MRP-specific cDNA from DNase-treated total RNA (two independent 
RNA preparations: 10 ng RNA1 or 14 ng RNA2, respectively); C2, C3: 
0.02 ng pAtMRPl or pAtMRP2; C4: no template; C5, C6: total RNA 
(83 ng RNA1 or 118ng RNA2, as used for cDNA synthesis) treated 
with DNase I. The length of DNA size markers in bp is given to the 
left. The length of the PCR products indicated to the right is as follows: 
MRP1, 178bp; MRPl+EcoRV, 124 + 54bp after EcoRV cleavage; 
MRP2, 184 bp. The ethidium bromide stained gel is shown in 
inverted colour to enhance contrast. 



of the reaction products. Prior to the analysis, PCR con- 
ditions were optimized to amplify the core sequence from 
both MRP genes with approximately equal efficiency 
when the two single-copy genes present in genomic 
DNA were used as template (Figure 2, sample CI). The 
two amplificates can be distinguished by size and by the 
unique EcoRV site in MRP2 (Figure 2, C2 and C3). Using 
two independent Arabidopsis RNA preparations as 
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template, MRP1 was always present in slightly higher 
amounts than MRP2 (Figure 2, Rl and R2). 
Examination of PCR products early in linear amplifica- 
tion phase showed that MRP1 was detectable two cycles 
earlier than MRP2, which correlates to an approximately 
4-fold excess of MRP1 over MRP2 (not shown). This is 
comparable to the ratio between RNase P and MRP 
RNAs in human cells where approximately twice as 
many RNase P RNAs than MRP RNA molecules are 
present (44), but in contrast to yeast, where equal 
amounts of both RNAs including the RNase P RNA pre- 
cursor were detected (45). 

Based on established models for RNase P and MRP 
RNAs from yeast and humans, and on new sequences 
from related plant species (our unpublished data), we 
suggest a novel structure model which differs in several 
aspects from previous depictions of plant MRP RNAs 
(Figure 3). A P7 helix connecting the C- and S-domains 
as in RNase P RNAs is present and in MRP2, a small 
eP15-like structure can be formed. P3 now accommodates 
the internal AU-rich loop which is conserved in most 
known RNase P and MRP RNAs and involved in 



protein binding (21,25,45). The proximal strand of the 
Arabidopsis loop resembles the proposed consensus 
sequence, although it is much smaller than in other eu- 
karyotes (46). Both RNAs conform to the known MRP 
consensus with the exception that L8 is only four- 
membered in plants (21,47). The MRP consensus is 
largely a subset of the eukaryotic RNase P consensus 
sequence; CR II and CR III however are signature 
elements specific for RNase P RNA. They have not 
previously been described in plants, but are present in 
both AtMRP RNAs. In the published structure represen- 
tations, both elements were hidden within a paired region 
of P12. In our alternative model, CR II and CR III are 
juxtaposed within an internal loop structure in P12, 
similar to their location in nuclear RNase P RNAs 
(Figure 3, inset). 

Antibodies specific for AtPoplp recognize two proteins in 
Arabidopsis and precipitate RNase P activity from wheat 
germ extracts 

Prior to performing immunoprecipitation experiments, the 
specificity of the antisera created against the recombinant 
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Figure 3. Secondary structure of Arabidopsis RNase MRP RNAs. The RNase P consensus nucleotides found in both RNAs are circled; the MRP 
consensus is indicated by dots next to the bases. CR II and CR III are indicated by grey shadowing; the alternative P12 structure of AtMRPI is 
depicted in the inset (center). MRP RNA1 is shown with the termini as determined by RACE and MRP2 with the ends suggested by (21). 
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Figure 4. Heterologous immunoprecipitation of wheat germ RNase 
P activity with AtPoplp-specific antibodies. Wheat germ extract was 
subjected to immunoprecipitation as described and the fractions tested 
for RNase P activity. S, supernatant; P, Protein A-Sepharose bound 
precipitate; W, wash fractions. Radiolabeled E. coli pre-tRNA Tyr was 
incubated with the following buffers or enzymes: 1, no treatment; 2, 
incubation in buffer F; 3, E. coli Ml ribozyme in buffer P2; 4, 
untreated wheat germ extract; 5, extract treated as for immunopre- 
cipitation; 6, supernatant after immunoprecipitation; 7-10, wash frac- 
tions; 11, immunoprecipitate; 12+13: same as 6+11, but the coupled 
antibody was pre-incubated with AtPoplp antigen prior to binding of 
the extract; 14: anti-AtPoplp-antibodies as additional control for sub- 
strate stability. 



Popl domain (AtPoplp, variant 1 u) was examined by 
western blotting. Using an enriched Arabidopsis protein 
preparation, a signal at ~ 100 kDa and a weak band at 
35 kDa were visible (Supplementary Figure S4). This cor- 
relates with the ORFs of mRNA variant 2 and the novel 
variant 3 which encode a 92.5 and 34.6 kDa proteins, re- 
spectively. The putative 73.8 kDa ORF encoded by 
mRNA 2b lacks the Popl domain and thus cannot be 
detected by this antiserum. Functional assessment of 
Poplp by in vitro complex formation between the recom- 
binant Popl domain and AtMRP RNAs showed only 
weak binding (Supplementary Figure S5). 

Because Arabidopsis is not amenable to direct biochem- 
ical analysis of RNA processing pathways, the well-estab- 
lished tRNA processing system from wheat germ (15,48) 
was used for coupled immunoprecipitation-activity tests. 
Wheat germ S23 extract was bound to matrix-coupled 
AtPoplp-specific antibodies and assayed for RNase P 
activity (Figure 4). The immunoprecipitate showed signifi- 
cant substrate turnover without degradation (lane 11); no 
activity was detected in the wash fractions (lanes 7-10). 
Saturation of the matrix-coupled antibodies with an excess 
of recombinant antigen prior to incubation with the pro- 
cessing extract completely blocked binding of RNase P 
activity, showing that recognition by the antisera is 
highly specific (compare lanes 12 and 13). The residual 
activity in the supernatant (lane 6) might be derived 
from mitochondrial RNase P, which is active under 
these conditions (15); it is a protein-only enzyme and 
will not be recognized by the antibodies specific for 
Poplp (13). This immunoprecipitation experiment 
shows that AtPoplp and its wheat homologue share anti- 
genic determinants in the Popl domain, and that wheat 
Poplp is associated with a complex containing RNase P 
activity. 



Two MRP RNA variants are present in wheat and are 
associated with purified RNase P preparations 

The co-immunoprecipitation of RNase P activity and the 
RNase P/MRP protein Poplp in the wheat system raised 
the question whether plant nuclear RNase P contains an 
RNA component, as is the case in other eukaryotes. We 
thus used an RNomics approach to detect unknown RNAs 
in the immunoprecipitate and in chromatographically 
purified wheat RNase P fractions. Prior to screening, the 
RNA quality in these preparations was evaluated by 
analysis of wheat MRP RNAs. Due to the lack of related 
sequence data, the core sequence between CR I and CR V 
was determined by genomic PCR. Completion of the RNA 
sequences by RACE revealed two slightly different 
molecules (TaMRPl/2; Figure 5) similar to several short 
wheat ESTs and to the annotated MRP RNA from Oryza. 
L8 deviates from the plant consensus GAGA. P3 is shorter 
than in Arabidopsis and without an internal loop; no CR 
II-CR III consensus is present in P12. 

To get insight into the relation of wheat MRP RNAs to 
the RNase P/MRP complex, the RNA content of partially 
purified RNase P preparations was examined by RT-PCR 
and subsequent sequencing. MRP-specific analysis of the 
immunoprecipitate revealed that only TaMRP2 is 
associated with the complex (Figure 6); in contrast, both 
RNA variants were detected in chromatographically 
purified RNase P (Blue Trisacryl pool) (15). Examination 
of the total RNA population in these preparations by 
non-specific RACE (49) showed that in the immuno- 
precipitate, TaMRP2 and a tRNA were present; cytosolic 
18S rRNA, 5.8S rRNA and the 5' external transcribed 
sequence prevailed (Table 1). Similarly, the chromato- 
graphically purified RNase P preparation contained 18S 
and 5.8S rRNA in addition to both MRP RNAs. The 
unequal distribution of the two MRP RNAs in the 
enzyme preparations can at present only be explained by 
possible differences in complex stability during the lengthy 
immunoprecipitation procedure; the presence of rRNAs 
and tRNA can be attributed to RNase MRP and RNase 
P activity. The prevalence of rRNAs over tRNA and MRP 
RNAs may be due to substrate and/or product binding 
properties of RNase P and MRP. In addition, the 
variable efficiencies for tailing and cDNA synthesis of the 
different RNA species during non-specific RACE may bias 
this qualitative analysis (49). The combined results of RNA 
analysis and activity-coupled immunoprecipitation experi- 
ments suggest that in wheat, Poplp is associated with MRP 
RNA and with RNase P activity. 



DISCUSSION 

The large central protein Poplp common to RNase P and 
MRP was used as a starting point to elucidate the com- 
position of these nuclear enzyme complexes in plants. In 
Arabidopsis, analysis of Popl mRNAs revealed several 
splicing variants, encoding at least two proteins with 
considerable size differences (Figure 1). Of the three 
annotated variants, the original At2G47290 mRNA 
(variant 1) is not detectable in vivo. The more recently 
annotated DQ069804 mRNA encodes a 826 amino acids 
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Figure 5. Secondary structure of RNase MRP RNAs from wheat. The 
two wheat RNAs differ at several isolated positions; only the P19 struc- 
ture is affected by the changes. The sequence of RNA1 is shown; dif- 
ferences in RNA2 are indicated by the corresponding bases positioned 
outside the structure. The alternative structure of P19 is shown in the 
insert. 



ORF similar to Poplp from other organisms (Supple- 
mentary Figure S3, Ath 2a). DQ069805 mRNA differs 
by alternative splicing of introns 1 and 2, leading to frame- 
shift and termination of a short ORF within exon 2; a 
second ORF starting at AUG2 immediately downstream 
encodes 659 residues and lacks the Popl domain, but is 
congruent with DQ069804 from exon 3 to 10 
(Supplementary Figure S3, Ath 2b). Analysis of the differ- 
ential splicing phenomenon by RACE revealed a novel 
mRNA of about 1200 nt, encoding a 307 amino acids 
ORF (Figure 1, variant 3; Supplementary Figure SI, 
FN673552 and Supplementary Figure S3, Ath 3). 
Between exons 1 and 4, the mRNA is identical to 
DQ069804. The 3'-end of this novel mRNA3 however is 
created by retention of intron 4, which provides an 
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Figure 6. Detection of RNase MRP RNAs in wheat germ RNase P 
preparations. The following PCR templates were used: (A) RNAs 
isolated from AtPoplp-specific wheat germ immunoprecipitate. 
Sample 1: cDNA prepared from DNase I-treated RNA; 2: DNase 
I-treated RNA; 3: mock-treated RNA. Nested PCR with 
MRP-specific primers was performed with templates 1-3, with 10 ng 
genomic DNA from wheat (CI) or without template (C2). (B) RNAs 
prepared from purified wheat germ RNase P (Blue Trisacryl pool) were 
treated as described in A. M: DNA marker (Gene Ruler low range, 
Fermentas); relevant sizes in bp are indicated to the right. The 170 bp 
PCR product is specific for wheat RNase MRP RNA. The ethidium 
bromide stained gel is shown in inverted colour to enhance contrast. 



Table 1. RNA species present in the Poplp-specific 
immunoprecipitate 





Gene 


Accession 
number 


Cloned 
fragments 


Nuclear 


5' ETS 


X07841 


10 


rRNA unit 


18S 


AJ272181 


13 




5.8S 


FJ229967 


9 




25 S 


AY049041 


2 


Others 


5 S rRNA 


AY3 16209 


2 




tRNA Asp 


AB005781 


1 




MRP RNA 2 


FN689353 (this article) 


4 




Mito 18S rRNA 


AP008982 


1 



All RNA species detected by non-specific RACE of the wheat 
immunoprecipitate are listed. Only sequences longer than 21 nt were 
considered; the accession numbers are representatives of multiple hits 
for each sequence. All listed hits are from wheat with the exception of 
tRNA Asp (A. thaliana), where no wheat sequence was available. The 
number of independent clones related to one RNA species is given in 
the right column. No sequences located within the Internal Transcribed 
Spacer 1 (ITS1) or ITS2 were detected. 5' ETS: external transcribed 
sequence upstream of 18S rRNA. 



in-frame stop codon (UGA2), polyadenylation signals 
and a cleavage site downstream of this stop codon (50). 
The 307 and 826 amino acids long ORFs starting at 
AUG1 are preceded by a strong initiation signal (43), sug- 
gesting translation in vivo. The 659 residues long ORF 
encoded by mRNA 2b however is lacking a Kozak 
sequence near AUG2, making efficient translation ques- 
tionable. Although weak, expression of the 307 and 826 
amino acids long proteins in vivo is supported by western 
analysis using antibodies against the Popl domain 
(Supplementary Figure S4). The 659 residues long ORF 
does not contain this conserved domain and thus will 
escape detection. 

The four highly conserved regions COR1-COR4 
required for complex formation and enzymatic activity 
of RNase P and MRP are present in the 826 amino 
acids full-length protein variant 2a from Arabidopsis (27) 
(Supplementary Figure S3). The 307 residues long protein 
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encoded by the novel mRNA variant 3 contains COR1 
and COR2, but lacks COR3 and COR4. In yeast, F 89 
and Q 90 in COR1 are important for the assembly of 
RN ase MRP, but not RNase P. Fg9 is conserved as F52 
in AtPoplp and Q 90 is replaced by R 53 , indicating that this 
function is at least partially conserved. Mutations of R233 
and Q234 in COR2 reduce tRNA processing in yeast; the 
equivalent positions in AtPoplp are Sn 7 and Gng. These 
differences to conserved amino acids with a known 
function in yeast suggest a change in the role of Poplp 
during plant RNase P evolution. T242, H243 and R249 
(COR2) are responsible for RNase P RNA maturation 
and assembly of the RNase P and MRP complexes in 
yeast; they are present as T 12 6, H 12 7 and R 133 in 
Arabidopsis. The POP-like domain (POPLD) spanning 
COR3 may be involved in nucleolar localisation of the 
protein; COR4 contains glycine-rich RNA binding 
motifs. Both regions contribute to maturation of yeast 
RNase P RNA, complex formation and activity of both 
enzymes. Most of these functionally important residues 
are present in the AtPoplp variants 2a and 2b; F 82 3 is 
important for RNase P activity in yeast and is conserved 
as F 750 in Arabidopsis. The adjacent G-rich stretch is 
replaced by a basic region. The absence of COR3 and 
COR4 in AtPoplp variant 3 implies that functions 
related to nucleolar localization, RNase P complex forma- 
tion and activity are missing in this protein. The annotated 
hypothetical protein Ath 2b lacks exons 1 and 2 and thus 
the complete Popl domain. The recombinant variant 1 
used for functional studies contains this domain and can 
form complexes with both AtMRP RNAs (Supplementary 
Figure S5). The weak binding observed in these experi- 
ments may be attributed to the lack of the POPLD 
region in this protein. 

Both annotated Arabidopsis MRP RNAs are present 
in vivo in similar amounts, excluding the presence of a 
pseudogene. Our new secondary structure models for 
these RNAs depict all helices conserved in other eukary- 
otes, as well as the hitherto unassigned CR II and CR III 
regions unique to RNase P RNAs. We have also estab- 
lished the first complete sequences of two MRP RNAs 
from wheat, allowing a more reliable structure prediction 
for plant MRP RNAs (Figure 5). The presence of two 
MRP RNAs is not unusual: the Arabidopsis genome 
contains two expressed genes; in Nicotiana, three RNA 
species have been detected; and several other organisms 
possess two genes for RNase P or MRP RNA (20,21). 
Although similar to a previously published genomic 
sequence from rice, the wheat RNAs deviates from the 
postulated L8 consensus (21). They have a short P3 
without an internal loop, suggesting a different mode of 
protein binding in Poaceae. CR II and CR III are present 
in some, but not all known plant MRP sequences, leaving 
open the question of a possible RNase P-like function in 
these RNAs. 

Antibodies specific for the recombinant Popl domain 
from Arabidopsis precipitate RNase P activity and MRP 
RNAs from wheat germ extract and purifed enzyme prep- 
arations, showing that AtPoplp and its wheat homologue 
share antigenic determinants in this domain. Likewise, the 
co-immunoprecipitation of rRNAs and tRNA indicates 



that Poplp is associated with RNase P and RNase MRP 
activity. This may be either in a single large 'processo- 
some-like' complex as described for yeast mitochondria 
(51) or in two similar, but separate complexes as in the 
nuclei of yeast and human cells (16-18,47). The soluble 
activity in the wheat immunoprecipitate is probably 
derived from mitochondrial RNase P, which is a 
protein-only enzyme unrelated to Poplp. In Arabidopsis, 
the same ProRPl protein also provides RNase P activity 
in the chloroplasts. Two similar proteins (ProRP2 and 3) 
localize to the nucleus; their function is still unknown (13). 
Although these nuclear ProRPs may be associated with 
the complex and may be responsible for the catalytic 
activity of nuclear RNase P, we cannot rule out at 
present that an RNA contributes to RNase P activity in 
plant nuclei. Further biochemical and genetic character- 
ization of both enzymes will be required to resolve these 
questions. 
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